Fast Computation of Kernel Estimators

نویسندگان

Vikas C. Raykar

Ramani Duraiswami

Linda H. Zhao

چکیده

The computational complexity of evaluating the kernel density estimate (or its derivatives) at m evaluation points given n sample points scales quadratically as O(nm)–making it prohibitively expensive for large data sets. While approximate methods like binning could speed up the computation they lack a precise control over the accuracy of the approximation. There is no straightforward way of choosing the binning parameters a priori in order to achieve a desired approximation error. We propose a novel computationally efficient −exact approximation algorithm for the univariate Gaussian kernel based density derivative estimation that reduces the computational complexity from O(nm) to linear O(n +m). The user can specify a desired accuracy . The algorithm guarantees that the actual error between the approximation and the original kernel estimate will always be less than . We also apply our proposed fast algorithm to speedup automatic bandwidth selection procedures. We compare our method to the best available binning methods in terms of the speed and the accuracy. Our experimental results show that the proposed method is almost twice as fast as the best binning methods and is around five orders of magnitude more accurate. The software for the proposed method is available online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Eecient Non-parametric Estimation of Probability Density Functions

Accurate and fast estimation of probability density functions is crucial for satisfactory computational performance in many scientiic problems. When the type of density is known a priori, then the problem becomes statistical estimation of parameters from the observed values. In the non-parametric case, usual estimators make use of kernel functions. If X j ; j = 1; 2; : : : ; n is a sequence of ...

متن کامل

Gamma Kernel Intensity Estimation in Temporal Point Processes

In this article we propose a nonparametric approach for estimating the intensity function of temporal point processes based on kernel estimators. In particular we use asymmetric kernel estimators characterized by the gamma distribution, in order to describe features of observed point patterns adequately. Some characteristics of these estimators are analyzed and discussed both through simulated ...

متن کامل

Asymptotic Behaviors of Nearest Neighbor Kernel Density Estimator in Left-truncated Data

Kernel density estimators are the basic tools for density estimation in non-parametric statistics. The k-nearest neighbor kernel estimators represent a special form of kernel density estimators, in which the bandwidth is varied depending on the location of the sample points. In this paper‎, we initially introduce the k-nearest neighbor kernel density estimator in the random left-truncatio...

متن کامل

Comparison of the Gamma kernel and the orthogonal series methods of density estimation

The standard kernel density estimator suffers from a boundary bias issue for probability density function of distributions on the positive real line. The Gamma kernel estimators and orthogonal series estimators are two alternatives which are free of boundary bias. In this paper, a simulation study is conducted to compare small-sample performance of the Gamma kernel estimators and the orthog...

متن کامل

Deriving Kernels from MLP Probability Estimators

In multi-class categorization problems with a very large or unbounded number of classes, it is often not computationally feasible to train and/or test a kernel-based classifier. One solution is to use a fast computation to pre-select a subset of the classes for reranking with a kernel method, but even then tractability can be a problem. We investigate using trained multilayer perceptron probabi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Fast Computation of Kernel Estimators

نویسندگان

چکیده

منابع مشابه

Eecient Non-parametric Estimation of Probability Density Functions

Gamma Kernel Intensity Estimation in Temporal Point Processes

Asymptotic Behaviors of Nearest Neighbor Kernel Density Estimator in Left-truncated Data

Comparison of the Gamma kernel and the orthogonal series methods of density estimation

Deriving Kernels from MLP Probability Estimators

عنوان ژورنال:

اشتراک گذاری